Famly Income,

School Ranking,

and Acceptance Rate

Tony Zhao

AP Statistics, Period 1, PRISMS

Central Question

How does family income affect university acceptance rate in the United States?

Data Source

  • A cencus published by u/powereddeath on r/ApplyingToCollege (A2C)
  • A2C contains 1.2M members, the largest college admissions subreddit
  • The cencus is for 2020 (class of 2024), because the raw data of newer year is not published (didn’t reply to me after contact).
  • Raw data avaialble here, original post here

Overview of the Cencus

ref: https://drive.google.com/drive/folders/1TyPIamNSgPM8LeMRSupOXfkjfz9OyePl

Top States of Residence

  1. California (22%)
  2. New York (8%)
  3. New Jersey (6%)
  4. Texas (6%)
  5. Massachusetts (4%)

Summary

  • The dataset is definitely not representative of the US population, but it is representative of the A2C community.
  • More male than female
  • Much more Asian than national average (48% vs 6%)
  • Very left leaning (liberal + progressive = 65%, right-wing + conservative = 5%)
  • Super high standardized test scores
    • average SAT: 1470, national average is 1050
    • average ACT: 33, national average is 19.4
  • Somehow similar to PRISMS?
  • Rich: 24% of families have income above 250k, whereas national median 80k in 2023

How income affects acceptance rate

  • Initially increases
  • Reaching a peak
  • Then decreases at the last income bracket
  • 95% CI for 1st income bracket is very wide, as sample size is small
  • CI for last income bracket is narrower, as the sample size is larger

Is this trend same across all schools?

  • Different financial aid policies
  • Different competitiveness
  • Private vs public
  • Need-blind vs need-aware
  • Different financial status of the school

Hypothesis: ranking is a major factor

  • Higher ranked schools are more competitive
  • Higher ranked schools have higher tuition
  • Rich students might apply more to highly ranked schools
  • Rich students might meet the criteria of highly ranked schools better

Which ranking system to use?

  • US News & World Report
    • Most popular
    • Focus on US schools
    • Liberal arts colleges are separated from universities
    • But the cencus contains both LACs and universities
  • QS World University Rankings
    • Focus on global schools
    • Not as popular in the US
    • Still no LACs
  • WSJ/Times Higher Education
    • Combining LACs and Universities!!

I should make the schools into different groups based on their rankings, otherwise the sample size of each school will be too small to draw any conclusions, and will result in a lot of noise.

  • \(R^2 = 0.702\) with linear best-fit line
  • Outliers in the two ends of x-axis
  • slope=-226, 226 less students per increase in 10-school ranking group

  • \(R^2 = 0.962\) with exponential best-fit line
  • Seems like the popularity of schools decreases exponentially as the ranking increases
  • rank group 191-200 has only 20 applicants, which is too low

How about dynamic grouping?

  • Given the first group, remember its total applicant
  • Finding the next group that has the most similar number of applicants as the first group
  • Not enough applicants for the last group

Let’s see if the trend changed once we consider school ranking

A little too messy, let’s split it into more graphs

Seems like both high-ranking schools and low-rankings schools consider family income the most, while mid-ranking schools consider family income less.

Verifying the Observed Trend Using Statistical Tests

Chi-Squared Test of Independence

  • Null Hypothesis: Family income and acceptance rate are independent
  • Alternative Hypothesis: Family income and acceptance rate are dependent
  • p-value represents the probability of observing the data if the null hypothesis is true

But chi-square dosn’t tell us the direction of the relationship.

Regression Slope Test

  • Null Hypothesis: The slope of the regression line is zero (no relationship)
  • Alternative Hypothesis: The slope of the regression line is not zero (there is a relationship)
  • p-value represents the probability of observing the data if the null hypothesis is true
  • r-value represents the strength and direction of the relationship

Possible Explanations for the Observed Trend

Need-blind Admissions

  • Top universities are often need-blind:
    • Princeton
    • Harvard
    • Stanford
    • MIT
    • Pennsylvania
    • Cornell
  • In fact, all top 10 universities on the WSJ/Times Ranking are need-blind
  • These universities are financially capable of admitting students without considering their financial status

Wholistic Review

Top universities often practice wholistic review, which means they consider more than just test scores and GPA. They considers the difficulties that financially challenged students face, and how they overcome those difficulties.

For example, both Princeton and MIT marked first generation and work experience as “considered” in their common data set (CDS).

The richest students are not reflected in the data

According to this New York Times article based on this research, students with top 0.1% parent income are 2.2x more likely to be admitted to top schools than average.

Financial is a huge factor in atheletes recruitment

Public Universities are Less Biased on Family Income

Thank You for Listening!